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1BH CENTER 1600/2900 



ED 



3 <110> APPLICANT: Gilbert, Michel 

4 Young, N. Martin 

5 Wakarchuk, Warren W. 

6 National Research Council o£ Canada 

8 <120> TITLE OF INVENTION : fusion Proteins for Use in Enzymatic Synthesis of 

9 Oligosaccharides 

1.1 <130> FILE REFERENCE': 01995 7 -01 291 OUS 

13 <140> CURRENT APPLICATION NUMBER: US 09/211,691 

14 <141> CURRENT FILING DATE: 1998-12-14 

16 <150> PRIOR APPLICATION NUMBER: US 60/069,443 

17 <151> PRIOR FILING DATE: 1997-12-15 
19 <160> NUMBER OF SEQ ID NOS : 18 

21 <1 7 0> SOFTWARE: Patentln Ver. 2.1 

23 -<210> SEQ ID NO: 1 

24 <211> LENGTH: 828 . 

25 <212> TYPE: DNA 

26 <213> ORGANISM : Neisseria meningitidis " 

28 <220> FEATURE: 

29 <221> NAME/KEY: CDS 

30 <222> LOCATION: (1)..(828) 

31 <223> OTHER INFORMATION: beta - 1 , 4 - ga la ctosy ltra 
3 3 % <4 00> SEQUENCE: 1 

34 atg caa aac cac gtt ate age tta get tec gec gca 

3 5 Met Gin Asn His Val lie Ser Leu Ala Ser Ala Ala 
36 1 5 .10 

38 cac att gec cjat acc ttc ggc agg cac ggc ate ccg 

39 His He Ala Asp Thr Phe Gly Arg His Gly He Pro 

4 0 20 .25 
42 gac gca ctg atg ccg tct gaa. agg ctg gaa cag gca 
4 3 Asp Ala- Leu Met Pro Ser Glu Arg Leu Glu Gin Ala 
■4 4 3 5 4 0 
46 gtc ccc ggc ttg teg gcg cac ecc tat ttg age gga 
4 7 Val Pro Gly Leu Ser Ala His Pro Tyr Leu Ser Gly 

40 50 55 60 

50 tgc ttt atg age cac gec gta ttg tgg aag cag gca ttg 

51 Cys Phe Met Ser His Ala Val Leu Trp Lys Gin Ala Leu 

52 65 70 75 

54 ctg ccg tat ate acc gta ttt gag gac gac gtt tta etc 

55 Leu Pro Tyr He Thr Val Phe Glu Asp . Asp Val Leu Leu 

56 85 90 

58 gag gaa aaa ttc ctt gec gaa gac get tgg ctg caa gaa 

59 G.lu Glu Lys Phe Leu Ala Glu Asp Ala Trp Leu Gin Glu 

60 100 105 
'62 ccg gat acc gec ttt ate gtc cgc ttg gaa a eg atg ttt 
6 3 Pro Asp Thr Ala Phe He Val Arg Leu Glu Thr Met Phe 
64 ' 115 120 125 
66 ctg acc teg ccc tec ggc gtg gcg gat tac tgc ggg cgc 



nsf erase ( lgtB ) 

gaa cgc agg gcg 
Glu Arg Arg Ala 
15 

ttt cag ttt ttc 
Phe Gin Phe Phe 
30 

■atg gcg gaa etc 
Met Ala Glu Leu 
4 5 

gtg gaa aaa gec 
Val Glu Lys Ala 



gac gaa ggt 
Asp G.lu Gly 
80 . 

ggc gaa ggt 
Gly Glu Gly 
95 

cgc ttt gac 
Arg Phe Asp 
110 

atg cac gtc 
Met His val 



48 



96 



141 



192 



240 



288 



■ 336 



384 



gec ttt ccg 432 
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67 Leu 


Thr 


Ser 


Pro 


Ser 


Gly 


Val 


Ala' 


Asp 


Tyr 


Cys 


Gly 


Arg 


Ala 


Phe 


Pro 




68 


130 










135 










140 












70 ctg 


ttg 


gaa 


age 


gaa 


cac 


tgg 


9gg 


acg 


f jcg 




tat 


ate 


att 


tec 


eg a 


4 80 


71 Leu 


Leu 


Glu 


Ser " 


Glu 


His 


Trp ■ 


Gly 


Thr 


Ala 


Gly 


Tyr 


He 


lie 


Ser 


Arg 




72 14 5 










150 










155 










160 




74 aaa 


gcg 


atg 


egg 


ttt 


ttc 


ctg 


gac 


agg 


ttt 


gec 


gec 


ctg 


ccg 


ccc 


gaa 


528 


7 5 Lys 


Ala 


Met 


Arg 


Phe 


Phe 


Leu 


Asp 


Arg 


Phe 


Ala 


Ala 


Leu 


Pro 


Pro 


Glu 




76 








165 










170 










175 






78 ggg 


ctg 


cac 


ccc 


gtc 


gat 


ctg 


atg 


atg 


ttc 


age 


gat 


ttt 


ttc 


gac 


agg 


576 


7 9 Gly 


Leu 


His 


Pro 


val 


Asp 


Leu 


I<kit 


Met. 


Phe 


Ser 


Asp 


Plie 


Phe 


Asp 


Arg 




30 






180 










185 










190 








82 gaa 


erg a 


atg 


ccg 


gtt 


tqc 


cag 


etc 


aat 


ccc 


gec 


ttg 


tgc 


gec 


caa 


gag 


624 


83 Glu 


Gly 


Met 


Pro 


Va 1 


Cys 


Gin 


Leu 


Asn 


Pro 


Ala 


Leu 


Cys 


Ala 


Gin 


Glu 




84 




195 










200 










205 










86 ctg 


cat 


tat 


gcC' 


aag ■ 


ttt 


cac 


gac 


caa 


aac 


age 


gca 


ttg 


ggc 


age 


ctg 


672 


87 Leu 


His 


Tyr 


Ala 


Lys 


Phe 


His 


Asp 


Gin 


Asn 


Ser 


Ala 


Leu 


Gly 


Ser 


Leu 




88 


210 










215 










220 












90 ate 


gaa 


cac 


gac 


cgc 


etc 


ctg 


aac 


cgc 


aaa t 


cag 


caa 


agg 


cgc 


gat 


tec 


720 


91 He 


Glu 


His 


Asp 


Arg 


Leu 


Leu 


Asn 


Arg 


Lys 


Gin 


Gin 


Arg 


Arg 


Asp 


Ser 




92 225 










230 










235 










240 




94 ccc 


gec 


aac 


aca 


ttc 


aaa 


cac 


cgc 


ctg 


ate 


cgc 


gec 


ttg 


ace 


aaa 


ate 


768 


95 Pro 


Ala 


Asn 


Thr 


Phe . 


Lys 


His 


Arg 


Leu 


He 


Arg 


Ala 


Leu 


Thr 


Lys 


He 




96 








245 










250 










255 






98 a.gc 


agg 


gaa 


agg 


gaa 


aaa 


cgc 


egg 


caa 


agg 


cgc 


gaa 


cag 


ttc 


att 


gtg 


816 


99 Ser 


Arg 


G.lu 


Arg 


Glu 


Lys 


Arg 


Arg 


Gin 


Arg 


Ary 


Glu 


Gin 


Phe 


He 


Val 




100 ■ 






260 


/ 








265 










270 








.10 2 cct 


ttc 


caa 


taa 
























B2f 


103 Pro 


Plie 


Gin 






























104 




275 






























107 <210> SEQ ID NO 


: 2 


























108 <211> LENGTH: 2 


75 


























109 <212> TYPE: 


PRT 




























110 <213> ORGANISM; 


Neisseria meningitidis 
















112 <400> SEQUENCE: 


2 


























113 Met 


Gin 


Asn 


His 


Val 


lie 


ser 


Leu 


Ala 


Ser 


Ala 


■ Ala 


Glu 


Arg 


Arg 


Ala 




114 1 








5 










10 










15 






115 His 


lie 


Ala 


Asp 


Thr 


Phe 


' Gly 


Arg 


His 


Gly 


He 


Pro 


Phe 


Gin 


Phe 


Phe 




116 






20 










25 










30 








117 Asp 


Ala 


Leu 


Met 


Pro 


Ser 


GlU 


Arg 


Leu 


Glu 


Gin 


Ala 


Met 


Ala 


Glu 


Leu 




118 




35 










4 0 










45 










119 Val 


Pro 


' Gly 


Leu 


Ser 


Ala 


His 


Pro 


Tyr 


Leu 


Ser 


Gly 


Val 


Glu 


Lys 


Ala 




120 


50 










' S5 










60 












121 Cy.s 


Phe 


.Met 


Ser 


His 


Ala 


Val 


Leu 


Trp 


Lys 


Gin 


Ala 


Leu 


Asp 


Glu 


Gly 




122 65 










70 










75 










80 




123 Leu 


Pro 


Tyr 


lie 


Thr 


Val 


Phe 


Glu 


Asp 


Asp 


Val 


Leu 


Leu 


Gly 


Glu 


Gly 




124 








85 










90 










95 






125 Glu 


Glu 


Lys 


Phe 


Leu 


Ala 


Glu 


Asp 


Ala 


Trp 


Leu 


Gin 


Glu 


Arg 


Phe 


Asp 




126 






100 










105 










110 








127 Pro 


Asp 


Thr 


Ala 


Phe 


He 


Val 


. Arg 


Leu 


Glu 


Thr 


Met 


• Phe 


Met 


His 


Val 
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128 115 120 125 

129 Leu Thr Ser Pro Ser Gly val Ala Asp Tyr Cys Cly Arg Ala Phe Pro 

130 130 135 :i 4 0 ' 

131 Leu Leu Glu Ser Glu His Trp Gly Thr: Ala Cly Tyr lie lie Ser Arg 

132 145 150. , 155 160 

133 Lys Ala Met Ary Phe Phe Leu Asp Arg Phe Ala Ala Leu Pro Pro Glu 

134 . 165 1.7 0 175 

135 Gly Leu His Pro Val Asp Leu Met Met; Phe Ser Asp Phe Phe Asp Arg 

136 180 105 190 

137. Glu. Gly Met Pro Val Cys Gin Leu Asn Pro Ala Leu Cys Ala Gin Glu 
133 195 200 205 

139 Leu His Tyr Ala Lys Phe His Asp Gin Asn Ser Ala Leu Gly Ser Leu 

140 210 2.15 22 0 

.141 lie Glu His Asp Arg Leu Leu Asn Arg Lys Gin Gin Arg Arg Asp Ser 

142 225 230 235 240 

.143 Pro Ala Asn Thr Phe Lys His Arg Leu Tie Arg Ala Leu Thr Lys lie. 
144 245 250 " 255 

14 5 Ser Arg Glu Arg Glu Lys Arg Arg Gin Arg Arg Glu Gin Phe He Val 

14 6 - 26 0 26 5 .270 

147 Pro Phe Gin 

148 27 5 

151 <210> SEQ ID NO: 3 

152 <21.1> LENGTH: 41 

153 <212> TYPE: DNA 

154 <213> OHGANTSM: Artificial Sequence 

156 <220> ' FEATURE : 

157 <223> OTHER INFORMATION; Description of Artificial Sequence : SYNTM-Fl 5' 

158 primer - 
160<400> SEQUENCE:' 3 

161 cttaggaggt catatggaaa aacaaaatat tgcggttata c 41 
J.64 <210> SEQ ID NO: 4 

165 <211> LENGTH: 4 5 

166 <212> TYPE: DNA ' 

167 <213> ORGANISM; Artificial Sequence 
16 9 <220> FEATURE: 

.170 <223> OTHER INFORMATION: Description of Artificial Sequence : SYNTM-R6 3' 
171 primer 

173 <4 0.0> SEQUENCE: 4 

174 cgacagaatt ccgccaccgc tttccttgtg attaagaatg ttttc 4 5 

177 <210> SEQ ID NO: 5 

178 <211> LENGTH :' 37 a 

179 <2'12> TYPE: DNA 

1H0 <213> ORGANISM: Artificial Sequence 

182 <220> FEATURE: 

183 <223> OTHER INFORMATION: Description of Artificial Sequence :SIALM-22F 5' 

184 primer 

18 6 <4 00> SEQUENCE: 5 

187 gcatggaatt ctgggcttga aaaaggcttg tttgacc 37 
190 <210> SEQ ID NO: 6 
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191 <211> LENGTH: 59 

192 <212> TYPE: DNA 

193 <213> ORGANISM: Artificial Sequence 

195 <220> FEATURE: 

196 <223> OTHER INFORMATION : Description of Artificial Sequence : SIALM-23K 3' 

197 primer 

199 <4 00> SEQUENCE: 6 

200 octaggtcga ctoattagtg gtgatggtgg tgatggttca ggtcttcttc gctgatcag 59 

203 <2I0> SEO ID NO: 7 

204 <211> LENGTH: 9 

205 <212> TYPE : PRT 

206 <213> ORGANISM: Artificial Sequence 

208 <220> FEATURE: 

209 <223> OTHER INFORMATION: Description of Artificial Sequence : Linker of 

210 , pFUS-01/2 
2.12 <40'6> SEQUENCE: 7 

213 Gly Gly Gly lie. Leu Ser His Gly lie 

214 1 .5 
217 <210> SEQ ID NO: 8 
218' <211> LENGTH: 0 

219 <212> TYPE: PRT 

2 20 <213> ORGANISM: Artificial Sequence 

222 <2 20> FEATURE: 

223 <223> OTHER INFORMATION : Description of Artificial Sequence : linker of 
2,24 pFUS-01/4 

226 <4 00> SEQUENCE: 8 . 

227 Gly Gly Gly I J.e Leu ' Ser Gly lie 

228 1 5 

231 <210> SEQ ID NO: 9 

232 <211> LENGTH: 58 
2 33 <212> TYPE-: . DNA 

234 <213> ORGANISM: Artificial Sequence 

236 <220> FEATURE: 

237 <223> OTHER INFORMATION: Description of Artificial Sequence : GalE- 5p. 5' 

238 primer 

240 <400> SEQUENCE: -.9 . 

241 gggacaggat ccatcgatgc ttaggaggtc atatggcaat tttagtatta ggtggagc 58 

244 <210> SEQ ID NO: 10 

245 <211> LENGTH: 42 

246 <212> TYPE : DNA 

247 <213> ORGANISM': Artificial Sequence 

249 <220> FEATURE: 

250 <223> OTHER INFORMATION: Description of Artificial Sequence : GalE- 3p 3* 

251 primer 

253 <400> SEQUENCE: 10 

254 gggggggcta gcgccgcctc ctcgatcatc gtaccctttt gg 4 2 
257 <210> SEQ ID NO: 11 

2 58 <211> LENGTH : 38 
259 <212> TYPE: DNA 
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260 <213> ORGANISM: Artificial Sequence 

262 <220> FEATURE: 

263 <223> OTHER INFORMATION: Description of Artificial Sequence : LytB- Nhel 5' 

264 primer 

266 <4 00> SEQUENCE: 11 

267 gygggggcta gcgtgcaaaa ccacgttatc agcttagc 38 

27 0 <210> SEQ ID NO: 12 

271 <211> LENGTH: 4 5 

272 <212> TYPE: DNA 

273 <213> ORGANISM: Artificial Sequence 

275 <220> FEATURE: 

276 <223> OTHER INFORMATION: Description of Artificial Sequence : LgtB-Sa II- 3 ' 

277 primer 

279 <400> SEQUENCE: 12 

280 ggygyyytcg acctattatt yyaaayycac aatyaactyt tcycy 4 5 

283 <2.10> SEQ ID NO: 13 

284 <211> LENGTH: 10 

28 5 <212> TYPE: PRT 

286 <213> ORGANISM: Artificial Sequence 

288 <220> FEATURE: 

289 <223> OTHER INFORMATION: Description of Artificial Sequence : peptide linker - 

291 <400> SEQUENCE: 13 ■ . 

292 Gly Gly Gly He Leu Ser His Gly He Leu 

293 1 5 10 '. 
296 <210> SEQ ID NO: 14 . 

■297 <211> LENGTH: 6 

298 <212> TYPE: PRT ' 

299 <213> ORGANISM: Artificial Sequence 

301 <220> FEATURE: ' 

302 <22 3> OTHER INFORMATION: Description of Artificial Sequence : 6 -His tail for 

303 purification 

305 <400> SEQUENCE: 14 

306 His His His His His His 

307 1 5 
310 <210> SEQ ID NO: 15 
,311 <2.11> LENGTH: 5 

312 <212> TYPE: PRT 

313 <213> ORGANISM: Artificial Sequence 

315 <22()> FEATURE: 

316 <22 3> OTHER INFORMATION: Description of Artificial Sequence : peptide linker 

318 <4 00> SEQUENCE: 15 ' 

319 Gly Gly Ala Ser Va.I 

320 1 ' 5, . 
323 <210> "SEQ ID NO: 16 

. 324 <211> LENGTH: '63 
.325 <212> TYPE: DNA 

326 <213> ORGANISM: Artificial Sequence 

328 <22 0> FEATURE: ' 

329' <223> OTHER INFORMATION: Description of Artificial Sequence : junction region 



\ 

■ \ 
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